Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
También | 1049 | 42 | 1 | 42.0000 |
p. | 153 | 25 | 1 | 25.0000 |
En | 5719 | 190 | 8 | 23.7500 |
pero | 5172 | 139 | 6 | 23.1667 |
Pero | 2051 | 69 | 3 | 23.0000 |
La | 8785 | 609 | 27 | 22.5556 |
ciento | 1241 | 20 | 1 | 20.0000 |
Su | 691 | 60 | 3 | 20.0000 |
Y | 2432 | 78 | 4 | 19.5000 |
Los | 3552 | 262 | 14 | 18.7143 |
El | 10490 | 622 | 34 | 18.2941 |
Las | 1833 | 155 | 9 | 17.2222 |
Así | 1002 | 14 | 1 | 14.0000 |
pues | 604 | 28 | 2 | 14.0000 |
Sin | 1264 | 14 | 1 | 14.0000 |
Ahora | 601 | 26 | 2 | 13.0000 |
Cómo | 193 | 13 | 1 | 13.0000 |
A la | 167 | 13 | 1 | 13.0000 |
según | 1534 | 52 | 4 | 13.0000 |
Se | 1240 | 75 | 6 | 12.5000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
millones | 2350 | 11 | 162 | 0.0679 |
miles | 323 | 1 | 12 | 0.0833 |
capaz | 159 | 1 | 10 | 0.1000 |
creo | 480 | 2 | 19 | 0.1053 |
p.m | 122 | 1 | 9 | 0.1111 |
a.m | 105 | 1 | 8 | 0.1250 |
cree | 338 | 2 | 16 | 0.1250 |
conseguido | 116 | 1 | 8 | 0.1250 |
par | 200 | 1 | 8 | 0.1250 |
EE.UU | 139 | 1 | 7 | 0.1429 |
producir | 85 | 1 | 7 | 0.1429 |
convertirse | 101 | 1 | 7 | 0.1429 |
verse | 58 | 1 | 7 | 0.1429 |
afectada | 64 | 1 | 7 | 0.1429 |
participar | 162 | 1 | 7 | 0.1429 |
contar | 170 | 2 | 14 | 0.1429 |
exceso | 59 | 1 | 7 | 0.1429 |
individuo | 48 | 1 | 7 | 0.1429 |
famosa | 59 | 1 | 7 | 0.1429 |
tipo | 650 | 2 | 13 | 0.1538 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II